Building a neural speech recognizer for quranic recitations

نویسندگان

چکیده

This work is an effort towards building Neural Speech Recognizers system for Quranic recitations that can be effectively used by anyone regardless of their gender and age. Despite having a lot available online, most them are recorded professional male adult reciters, which means ASR trained on such datasets would not female/child reciters. We address this gap adopting benchmark dataset audio records consists both genders from different ages. Using dataset, we build several speaker-independent NSR systems based the DeepSpeech model use word error rate (WER) evaluating them. The goal to show how tuned certain perform test set other gender. Unfortunately, number female in our rather small while much larger. In first experiments, avoid imbalance issue between two down-sample part match part. For subset results interesting with 0.968 WER when tested recitations. same gives 0.406 On hand, training testing it recitation 0.966 0.608 WER.

منابع مشابه

A Speech Recognizer Based on Locally Recurrent Neural Networks

Speech recognition systems (SRS) designed for applications in low cost products like telephones or in systems with energetic constraints like autonomous vehicles are faced with the demand for solutions with low complexity. A small vocabulary consisting of a few command words as well as the digits is suucient for most of the applications but has to be recognized robustly. Here we report about in...

متن کامل

A Virtual Vocabulary Speech Recognizer

A system for the automatic recognition of human speech is described. A commercially available speech recognizer sees its recognition vocabulary increased through the use of virtual memory management techniques. Central to the design are issues concerning the nature of speech, its effectiveness as an isolated mode of communication with computers, and its role as a part of a multi-modal communica...

متن کامل

Towards Quranic reader controlled by speech

In this paper we describe the process of designing a task-oriented continuous speech recognition system for Arabic, based on CMU Sphinx4, to be used in the voice interface of Quranic reader. The concept of the Quranic reader controlled by speech is presented, the collection of the corpus and creation of acoustic model are described in detail taking into account a specificities of Arabic languag...

متن کامل

Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition

We present results that show it is possible to build a competitive, greatly simplified, large vocabulary continuous speech recognition system with whole words as acoustic units. We model the output vocabulary of about 100,000 words directly using deep bi-directional LSTM RNNs with CTC loss. The model is trained on 125,000 hours of semi-supervised acoustic training data, which enables us to alle...

متن کامل

Improving Speech Recognizer Performance

This thesis investigates N-best hypotheses reranking techniques for improving speech recognition accuracy. We have focused on improving the accuracy of a speech recognizer used in a dialog system. Our post-processing approach uses a linear regression model to predict the error rate of each hypothesis from hypothesis features, and then outputs the one that has the lowest (recomputed) error rate....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Speech Technology

سال: 2022

ISSN: ['1381-2416', '1572-8110']

DOI: https://doi.org/10.1007/s10772-022-09988-3